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ABSTRACT 


This  paper  presents  results  of  experimentation  aimed  at  identifying 
suitable  starting  rules  for  discrete  event  simulations.  A starting  rule  is 
a decision  rule  that  tells  a simulation  analyst  when  to  begin  collecting 
data  that  are  relatively  free  of  the  initial  conditions  of  a simulation.  The 
starting  rules  described  here  rely  for  decision  making  on  a comparison  between 
a priori  information  on  interarrival  and  service  times  and  corresponding 
sample  quantities  computed  during  the  course  of  a simulation.  Testing  of 
the  first  proposed  rule  on  a single-server  queueing  simulation  with  exponen- 
tial interarrival  and  service  times  revealed  a serious  inadequacy.  However, 
an  examination  of  just  how  this  inadequacy  arose  led  to  a second  proposal  for  a 
starting  rule.  When  tested  in  a parallel  simulation  the  second  rule  produced 
considerably  more  favorable  results.  In  addition,  a perusal  of  the  distri- 
bution of  starting  time  for  1000  replications  suggests  a direction  for  future 
research  aimed  at  reducing  this  starting  time. 


1.  Introduction 
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Among  statistical  problems  that  arise  in  the  course  of  running  a discrete 
event  simulation,  the  effect  of  initial  conditions  on  simulation  output  has  long 
occupied  a prominent  position.  Initial  conditions  refer  to  the  state  of  critical 
variables  at  the  beginning  of  a simulation  run.  Because  of  the  dependence  among 
phenomena  in  a simulation  and  the  temporal  nature  of  much  of  this  dependence, 
the  choice  of  initial  conditions  influences  the  time  paths  of  these  phenomena. 
Since  interest  traditionally  has  focused  on  steady  state  behavior,  research  on 
the  topic  of  initial  conditions  has  concentrated  on  ways  of  diluting  their 
influence  on  the  sample  records  used  for  analysis.  Gafarian,  Ancker  and 
Morisaku  (1977)  have  reviewed  and  evaluated  the  published  proposals  for 
solving  this  problem.  Their  findings  reveal  that  a viable  solution  remains 
to  be  uncovered.  The  present  paper  provides  a framework  for  research  into  this 
topic  and,  based  on  initial  findings,  reconmends  procedures  to  overcome  the 
problem  of  initial  conditions.  The  procedures  rely  on  ancillary  information 
that  is  available  to  the  simulation  user  and,  when  properly  used,  can  induce 
acceptable  behavior  in  sample  paths. 

The  analysis  of  queueing  systems  represents  the  most  coninon  use  of  discrete 
event  simulation.  To  begin  a simulation  run  of  a queueing  system  one  needs  to 
specify  the  state  of  congestion  that  prevails  at  that  moment.  Since  the  purpose 
of  inquiry  is  to  uncover  congestion  characteristics,  one  can  hardly  be  expected 
to  meet  this  need  with  anything  but  frustration.  Nevertheless,  there  is  one 
set  of  initial  conditions  whose  effect  on  time  paths  is  relatively  straightforward. 
If  a simulation  begins  with  an  arrival  to  a system  with  no  busy  servers  and  no 
queues  one  anticipates  that  the  system  will  appear  uncongested  near  the  beginning 
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of  the  run.  That  is,  queue  lengths  and  waiting  times  appear  shorter  than  one 
would  expect  in  the  steady  state.  Although  this  set  of  initial  conditions  leads  • 
to  undesirable  behavior,  its  ease  of  implementation  relative  to  other  initial 
conditions  gives  it  greater  appeal  in  practice.  The  procedures  to  be  described 
here  all  begin  with  this  initial  condition  which,  as  we  show,  can  be  made  to 
pose  no  special  problem. 

To  overcome  the  inadequacy  of  past  proposals  this  paper  presents  an  approach 
to  solving  the  initial  conditions  problem  that  uses  ancillary  information  avail- 
able to  an  analyst  before  and  during  execution  of  a simulation  of  a queueing 
system.  This  information  consists  of  arrival  and  service  rates  or  their  corres- 
ponding reciprocals,  mean  interarrival  and  mean  service  times.  The  principal 
idea  is  to  compute  sample  quantities  such  as  sample  mean  interarrival  and  ser- 
vice times  and  compare  them  with  their  corresponding  true  values.  When  the 
deviations  between  sample  and  corresponding  true  quantities  fall  within  prespeci- 
fied tolerances  one  begins  data  collection  for  purposes  of  statistical  inference 
about  the  steady  state.  Prior  to  beginning  the  experimental  part  of  this  study, 
the  rationale  here  was  that  making  sample  quantities  representative  of  their 
corresponding  time  quantities  would  make  the  state  of  the  system  at  the  time 
that  data  collection  commences  representative  of  the  true  steady  state.  As  our 
results  will  show,  putting  the  system  exactly  in  the  steady  state  exceeds  the 
capacity  of  our  suggested  rule.  However,  the  rule  does  enable  one  to  put  a 
system  into  a state  of  above  average  congestion.  This  alternative  capacity  has 
value  since  if  the  choice  lies  between  beginning  data  collection  In  an  under- 
congested or  overcongested  system,  most  analysts  would  prefer  the  overcongested 
situation.  This  Is  especially  true  If  the  purpose  of  the  analysis  Is  to  Infer 
congestion  characteristics. 
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Although  one  easily  appreciates  the  notion  of  a data  collection  starting 
rule  based  on  a comparison  of  sample  and  theoretical  values,  several  issues 
need  resolution  before  one  can  implement  such  a rule.  The  issues  include: 

1.  Which  quantities  should  enter  the  decision  to  start  data 
collection? 

2.  What  should  the  tolerance  be  for  the  comparison  of  sample 
and  theoretical  quantities? 

3.  What  explicit  form  should  the  starting  rule  assume? 

4.  For  which  quantity  does  one  wish  to  collect  data? 

5.  How  does  one  evaluate  a rule's  performance? 

6.  How  does  the  tolerance  affect  the  performance  of  the  rule? 

Although  the  present  paper  addresses  each  of  these  issues,  one  can.iot  claim 
answers  that  apply  across  all  conceivable  queueing  simulations.  Nevertheless, 
our  f'ndings  do  reveal  procedures  that  should  prove  useful  for  many  single 
server  queueing  systems  and  conceivably  for  some  multiserver  system  as  well. 

Section  2 presents  a discussion  of  the  aforementioned  issues  and  formulates 
rj'io  1 .experimental  results  for  which  appear  in  Section  3 with  evaluation.  The 
evaluation  leads  to  the  formulation  of  an  alternative,  rule  2,  empirical  results 
for  which  also  appear  in  Section  3 with  a demonstration  of  its  superiority  over 
rule  1.  As  the  reader  already  may  have  noted,  the  aforementioned  rules  allow 
for  a variable  number  of  observations  to  be  omitted  from  a sample  record  at 
the  beginning  of  a simulation  run.  Because  the  experimental  results  indicate 
that  this  number  may  be  large.  Section  4 formulates  rule  3,  a modification  of 
rule?  ,that  allows  for  a dramatic  reduction  in  the  number  of  unused  observations. 
The  efficacy  of  this  rule  remains  to  be  tested. 

2.  Problem  Formulation 

Consider  a simulation  model  of  a queueing  system  with  m servers  in 
parallel.  Independent  Interarrival  times  with  mean  1/x  and  independent  service 
times  with  mean  l/<o  . Let  T^  denote  the  elapsed  time  between  arrivals  of 
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jobs  i-1  and  i and  let  S.  denote  the  service  time  of  arrival  i . Assume 
that  the  simulation  begins  with  the  arrival  of  job  1 to  an  empty  queue  and 
m idl^  servers.  Let  X.  denote  the  system  time  of  completion  i where  system 
time  denotes  waiting  time  plus  service  time.  Assume  that  an  ultimate  objective 
of  analysis  is  to  infer  the  characteristics  of  the  system  time  stochastic 
process  from  a sample  record  of  system  times.  Moreover,  one  wishes  to  conduct 
this  analysis  free  of  the  undercongesting  influence  of  the  empty  and  idle 
initial  conditions.  Suppose  the  sample  record  is  • • • »^N+M 

M IS  specified.  Then  the  problem  at  hand  is  to  determine  N in  a simulation 
i^un  so  that  each  come  from  the  steady-state  probability  distri- 

Ou't- 1 On . 

Stfiecting  a Rule 

After  n completions  occur  during  a simulation  run  one  can  estimate  1/A 
-1  ” -1 

anc  l/(n  by  n T.  and  n [ S.  respectively.  These  estimates  are 

i=l  ’ i=l  ^ 

unbiased  and  independent  of  initial  conditions.  In  principle  the  closer  the 
sample  quantities  are  to  their  respective  means,  the  more  one  inclines  to  believe 
that  subsequent  ^n+r  ^n+2’'”  each  represent  steady-state  system  times.  Then 
one  conceivable  starting  rule  is 

n n 

N = min(n:  |j;  T.  - n/A|  s 6,,  17  S.  - n/a)|  s 6,)  0 < 6,,  6,  , 

i=l  1 1 ^ 

where  one  needs  to  specify  the  tolerances  5^  and  62  explicitly.  We  call 

I 

N the  starting  time  since  data  collection  starts  with  completion  N+1  . 

At  the  outset  of  inquiry  into  starting  rules,  working  with  two  tolerances 
would  be  difficult.  An  alternative  system  quantity  of  interest  is  the 
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activity  level  or  traffic  intensity 

(1)  p = arrival  rate/no.  of  servers  x service  rate 

= A/m  10  , 

for  wnich  one  has  an  estimate 


0 < 6 • ! 

Thl.  f'rst  starting  rule  that  we  test  is  closely  related  to  (2). 

For  fixed  n a considerable  amount  is  known  about  pp  . We  consider  | 

one  special,  but  important,  case.  Suppose  that  interarrival  times  have  a p 
stage  Erlang  distribution  with  mean  1/A  and  that  service  times  have  a q stage 
Erlang  distribution  with  mean  l/w  . Then  qp^/pp  has  an  F distribution 
with  2np  and  2nq  degrees  of  freedom.  Moreover,  j 


N N 

Pw  = I S./m  I 

i=l  ’ i=l  ' 

One  starting  rule  based  on  (1)  is 


(2; 


N = min(n:  i p^  - p|  5 ) 


E(pp)  = pnp/(nq  - 1) 

This  suggests  that  when  working  with  Erlang  distributions  one  use 

purposes  and  consider 

A 

Rule  1 N = min(n:  |Pp  ~ p|  l 6)  0 < 6 . 

We  note  in  passing  that  rule  1 will  not  be  the  ultimate  recommendation 
that  we  make  to  potential  users.  However,  a careful  analysis  of  the  results 
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for  this  rule  is  a prerequisite  for  guidance  in  identifying  a better  rule. 

This  we  do  in  Section  3. 

Selecting  a Tolerance  Level  6 

Let  us  now  turn  to  the  choice  of  6 . It  is  not  difficult  to  anticipate 
an  inverse  relationship  between  N and  6 . Also  one  expects  that  a smaller 
iS  should  make  the  steady  state  more  of  a reality  for  subsequent  system  times. 
Although  we  need  to  rely  on  experimentation  to  determine  how  6 affects  this 
closeness  to  steady  state  one  additional  observation  deserves  attention.  If 
one  were  to  choose  6 = ap  for  0 < a then  N would  have  a distribution 
independent  of  p . This  appears  contrary  to  intention  for  one  intuitively 
expects  initial  conditions  to  influence  less  for,  say,  p = .7  than  for 

p = .9  . Therefore  we  avoid  this  approach  to  setting  6 levels.  Section  3 
studies  starting  rules  for  6 = .0025,  .005,  .01,  .02,  .05  and  .1  . 

Criteria  for  Evaluating  a Starting  Rule 

One  now  needs  a method  for  evaluating  rule  1.  In  particular,  one  wants 
the  rule  to  produce  a sequence  of  steady-state  system  times. 

For  single  server  queueing  models  Lindley's  equation  tells  us  that  [4  ] 


^N+1  ^ S(^+i  • 


which  implies  that  is  a function  of  Xj^,  , and  S^^^^  . However, 

only  Xj^  conceivably  could  depend  on  the  initial  conditions  at  the  beginning 
of  the  simulation.  Therefore,  if  Xj^  does  not  have  the  steady-state  distribu- 
tion one  is  skeptical  that  Xj^^^  does;  if  Xj^^^  does  not  have  the  steady- 
state  distribution  one  is  skeptical  that  Xj^+2  Here  one  can  regard 

X|^  as  the  initial  condition  that  determines  the  extent  to  which  X|^^2*--- 

have  steady-state  distributions  and  Xj^^^  as  the  observation  most  influenced  by 
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this  initial  condition.  The  remainder  of  this  paper  concentrates  on  the 
distribution  of  as  the  criterion  of  evaluation. 

Tests  Used  for  Evaluation 

To  test  rule  1 we  use  a single  server  (m  = 1)  model  with  Poisson  arrivals 

and  exponential  service  times.  For  specified  a,  w and  6 we  compare  the 

empirical  cumulative  distribution  function  of  for  1000  independent  relica- 

tions  with  the  theoretically  known  steady-state  cumulative  distribution  function 

(c.d.f.)  , which  is  exponential  with  mean  p/(l  - p)  [4  ] . 

Let  Nj  denote  the  starting  time  on  replication  j and  Xj  the  system 

<1 

time  of  completion  N^+1  on  replication  j . Consider  J replications  with 

sample  data  X^  N +1  ’ ' ‘ ‘ ’^J  N +1  independent  and  identically  distri- 

* 1 * J 

buted  when  using  rule  1.  We  wish  to  test  the  hypothesis  Hg:  Xi  ^^,...,Xj 

* 1 ’ J 

have  the  c.d.f. 


F(x)  = 1 - e 


0 < X < “ 


Under  Hg  the  statistics 


j = 1,...,J 


are  independent  uniform  deviates  with  c.d.f. 


G(y)  * y 


0 < y ^ 1 


The  empirical  c.d.f  of  the  Y.'s  is  for  all  y 

V 


■ J C '(0. 
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I being  the  indicator  function.  To  check  Hq  one  examines  the  deviations 

A(y)  = Gj(y)  - y 0 < y < 1 . 

or  functions  of  these  deviations.  For  the  Kolmogorov-Smirnov  goodness-of-fit 
test  one  uses  the  test  statistic 

D*  = sup  lA(y)l  . 
y 

Critical  values  of  the  test  statistic  appear  in  Miller  (1956)  and  Owen 
(1962).  For  the  chi-square  goodness-of-fit  test  with  K equiprobable  cells 
the  statistic  is 


P K P 

X = JK  I [A(i/K)  - A(i/K  - 1/K)]^ 
i=l 

whicn  for  large  J has  the  chi-square  distribution  with  K - 1 degrees  of 
freedom  under  Hq  . For  the  Anderson-Darl ing  test  [ 1 ] one  uses  the  statistic 

1 

= n /i[A(y)]^/y(l  - y)}  dy  , 

0 


which  is  particularly  sensitive  to  departures  of  Gj(y)  from  G(y)  in  the  tails 
of  the  distribution.  Critical  values  of  the  test  statistic  appear  in  Lewis  [ 5 ]. 

The  aforementioned  statistics  test  for  departures  from  Hq  . In  many  cases, 
including  the  present,  it  is  of  interest  to  study  the  nature  nf  the  departure  if 
Hq  is  rejected.  For  us  the  most  desirable  alternative  is  to  show  that  X,  ^ .i, 

I I 

•••.Xj  HI  have  a distribution  that  is  stochastically  greater  than  the  steady- 
* J 

state  time  distribution.  Let  W and  V be  random  variables  with  c.d.f. 's  F,.  and  F,, 

W V 
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One  says  that  W is  stochastically  greater  than  V if  - Fy" 

is  non-negative  and  not  identically  zero  on  the  open  interval  (0,1)  , 
Fy  ^ and  Fy  ^ denote  the  right  continuous  inverses  of  F^^  and  Fy 
In  the  present  case  this  neans  that  the  system  times 

1 * J 

exhibit  more  congestion  than  steady-state  system  times  would  show. 

The  Kolmogorov-Smirnov  test  allows  one  to  check  the  hypothesis 
^r'  ^1  N +1’”'’^J  N +1  ^ distribution  that  is  stochastically 

greater  than  the  F in  (4).  The  test  statistic  is 


D’  = -inf  A(y) 

y 

Owass  (1962)  describes  an  additional  helpful  measure  of  discrimination 

statistic 

1 

0 

gives  the  proportion  of  G(y)  that  lies  below  G(y)  = y . Under  Hq 
the  uniform  distribution  on  (0,  1)  . If  is  true  one  expects  U 
close  to  unity. 

3.  Experimental  Results 

This  section  begins  with  a presentation  of  results  for  rule  1 for 
time  in  a M/M/1  queueing  simulation  with 


(0=1 

X = .7  and  .9 

6 * .0025,  .005,  .01,  .02,  .05,  .1 


where 

respectively. 


The 


U has 
to  be 

system 


J = 1000  . 

Each  of  the  1 x 2 x 6 = 12  design  points  was  replicated  J = 1000  times  and 
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on  each  replication  of  each  design  point  and  pj  were 

0 J 

recorded.  To  check  on  the  correctness  of  the  simulation  1000  replications 

with  = 1000  for  j = 1,...,1000  were  run  for  each  design  point.  On 

each  replication  system  time  was  recorded. 

Table  1 gives  the  goodness-of-fit  statistics  for  the  empirical  c.d.f.'s 

of  Xj  for  each  of  the  design  points.  For  tests  of  size  a = .05  the 

0 

critical  values  of  the  Kolmogorov-Smirnov,  Anderson-Darling  and  chi-square 
tests  are  .0430,  2.492  and  30.1  respectively.  The  results  are  dismal,  Hq 
being  rejected  in  all  cases  but  the  fixed  starting  time  case  of  N = 1000  . 

This  last  observation  merely  confirms  the  fact  that  initial  conditions 
hardly  affect  completion  1001  for  p = .7  and  .9  . To  obtain  insight  into 
the  nature  of  the  deviations,  plots  of  A(y)//y(l-y)  were  prepared  for  each 
c.d.f.  in  Figures  1 and  2 in  the  Appendix.  Notice  that  for  p = .7  the  curves 
of  the  deviations  become  negative  as  6 decreases.  By  contrast  for  p = .9 
the  curves  become  more  positive  as  6 decreases. 

Although  initially  puzzling  the  cause  of  poor  performance  becomes  apparent 
in  Figure  3.  It  reveals  that  small  system  times  usually  occur  with  large 
Pj^'s  (small  p/(p  + P|^))  and  large  system  times  occur  with  small  Pj^'s 
(large  p/(p  + p^^))  . The  explanation  of  behavior  is  now  apparent.  When  a I 

large  satisfies  rule  1 it  usually  does  so  by  entering  the  acceptance  interval  | 

[p  - 6,  p + 6]  from  above.  This  implies  that  completion  N has  either  | 

a short  service  time  or  a long  interarrival  time.  From  (3)  it  is  clear  that 
either  of  these  manifestations  inclines  to  reduce  Xj^i  through  . Conversely, 

A 

a small  pj,^  that  satisfies  rule  1 usually  enters  the  acceptance  interval  from 

^F(Xj^^^)  , where  ( 4 ) defines  F , maps  system  time  onto  (0,  1)  , as  does 
the  transformation  p/(p  + pj^).  | 

! 
I 
! 

1 
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Table  1 

Goodness-of-Fit  Statistics 
for  the  Distribution  of  X.. 
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below.  This  Implies  that  completion  N has  either  a long  service  time  or  a 
short  interarrival  time,  manifesting  a larger  than  average  . 

An  appropriate  partition  further  clarifies  this  issue.  Let  A(a,  b)  = [a,  b] 
for  a ^ b . Consider  the  starting  rules 


Rule  la  N = min(n:  C A(p  - 6,  p)  and  p^.  ^ A{p,  p + 6)  , i = l,...,n-l) 

Rule  lb  N = min(n:  p^  C A(p,  p + 6)  and  p^  ^ A(p  - 6,  p)  , i =1  ,...,n-l) . 


Together  these  give  rule  1.  Figures  4 and  5 show  deviations  for  those  of  the 
1000  replications  that  satisfy  rule  la  and  Figures  6 and  7 display  deviations 
for  the  remaining  replications  that  satisfy  rule  lb.  The  clarification  is 
relatively  iimediate.  As  6 decreases  rule  la  results  in  negative  deviations 
and  rule  lb  results  in  positive  deviations.  A little  thought  provides  a 
plausible  explanation.  In  order  for  rule  la  to  apply  on  replication  j the 
service  time  for  completion  N.  tends  to  be  larger  than  average.  Consequently, 

J 

the  associated  system  time  is  larger  than  average.  The  net  effect  is  that  the 
distribution  of  system  times  based  on  rule  la  is  stochastically  greater  than 
the  steady-state  distribution  of  system  time.  A one  sided  test  of  the  Kolmo- 
gorov-Smimov  statistics  with  a = 0.05  for  rule  la  easily  confirms  this  obser- 
vation for  all  6 and  p = .7  and  .9  . Moreover,  the  U*s  provide  additional 
support.  By  contrast  rule  lb  produces  system  times  whose  distribution  is 

stochastically  less  than  the  steady-state  distribution. 

To  summarize  briefly,  rule  1 cannot  guarantee  a steady-state  distribution 

as  6 decreases.  However,  rule  la  produces  a stochastically  greater  than 

distribution  as  6 decreases.  This  behavior  implies  larger  than  expected 

system  times  so  that  rule  la  leads  to  an  overcongested  system  at  the  beginning 
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of  data  collection.  Since  the  objective  is  to  measure  congestion  we 
regard  this  as  a reasonable  situation  to  induce  when  steady  state  is  unob- 
tainable. 

To  recorrmend  rule  lb  would  be  somewhat  shortsighted.  Clearly  if 
p^.  C A(p,  p + 6)  for  i < n one  cannot  realize  the  conditions  of  the  rule. 

To  overcome  this  inadequacy  consider 

Rule  2 N = min(n:  ) p^  - p | ^ 6 and  p^  > p^_^ ) . 

This  rule  allows  for  starting  data  collection  only  when  p^  demonstrates  an 

A 

increase  over  p^_^  . But  one  expects  this  to  happen  when  completion  n+1  has  a 

longer  service  time  or  shorter  interarrival  time  than  average  thereby  contribu- 

ting to  an  above  average  system  time. 

For  each  value  of  ,>  and  cs  1000  replications  were  run  using  rule  2 and 
was  recorded.  Table  1 shows  the  goodness-of-fit  statistics.  Also,  for 
a = 0.05  and  p = .7  a test  of  the  Kolmogorov-Smirnov  statistic  D"  supports 

the  assertion  that  rule  2 has  induced  a "stochastically  greater  than"  distribu- 
tion for  . Fcr  p = .9  D'  supports  this  type  of  behavior  only  for 

6 = .0025  . Figures  8 and  9 show  the  deviations. 

On  the  basis  of  the  accumulated  empirical  evidence  to  date,  one  inclines 
to  recommend  the  use  of  rule  2 with  6 =.0025.  Although  we  do  not  quarrel 
with  this  recommendation,  this  advice  should  be  regarded  as  a temporary  measure 
on  at  least  three  grounds.  Firstly,  we  have  no  experience  with  p > .9  . 

Secondly,  we  have  no  experience  with  multiserver  systems.  Thirdly,  the  sample 
quantiles  of  starting  time  for  rule  2 and  6 = .9  in  Table  3 are  cause  for  concern. 
Notice  that,  although  90  percent  of  the  starting  times  are  less  than  3099, 
one  percent  exceeds  48334.  In  our  opinion  the  risk  of  excessive  cost  is  far 
too  great  to  regard  rule  2 as  an  end  in  itself.  Therefore,  research  on  starting 
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rules  that  produce  shorter  starting  times  is  in  order.  Section  4 outlines 
prospective  research  on  this  extension.  The  motivation  for  the  approach 
relies  on  an  empirical  observation  that  as  6 decreases  the  correlation 
between  N and  becomes  small.  Although  this  observation  needs 

careful  study,  the  preliminary  evidence  encourages  us  to  consider  its  impli- 
cations for  starting  rules. 


Table  3 

Sample  Quantiles  of  Starting  Time  for  Rule  2 

P = .9 

pr(N  <_  n)  = p 


n 


.95 

.98 

.99 

141 

242 

457 

335 

644 

759 

1193 

2350 

4274 

2268 

6981 

10592 

6318 

14098 

29718 

9670 

27512 

48334 

maximum 

1 

mean 

536 

33 

1861 

74 

10230 

248 

56953 

642 

193990 

1586 

525550 

3471 

4.  An  Iterative  Rule 

The  observation  that  and  N may  be  uncorrelated  for  small  6 suggests 

an  approach  to  effecting  the  same  result  that  rule  2 induces  at  a considerably 
smaller  cost.  Suppose  X^^^^  and  N are  independent  and  let 

dj  = pr(N  < j) 


r 1 

I 
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I 

i 

based  on  rule  2.  Also  define  the  local  sample  activity  level 

N j 

I s. 

p . j=ml+l  J N - ml  - 1 I 

I.N  ■ N ’ N - ml  ; 

I T.  1 

j=ml+l  ^ ^ 

i 

where  m is  a positive  integer  and  I is  the  integer  part  of  (N-l)/m  . 

I 

I This  sample  activity  Ipvel  applies  for  the  M/M/1  and  may  require  an  adjust- 

j ment  in  its  correction  factor  for  distributions  other  than  the  exponential.  j 

! Consider  the  rule 

I 

I 

Rule  3 N*  = min(n:  |p,  - p|  < 6 and  p,  > pt  _ 

I.n  ''I—  ''I,  n-1 

and  ml  / n-1 ) . 

In  words  this  rule  requires  us  to  use  a sample  activity  level  based  on  at  | 

most  m past  completions.  The  quantity  I denotes  the  number  of  times  we  ' 

need  to  reset  p j ; i.e.,  the  number  of  iterations.  A little  thought  shows 
that 

pr(I  " 1)  = (1  - q )’  q„  i = 0,1 

m m 

Then  I has  a geometric  distribution  with  mean  (1  - q )/q  and  variance 

^m  m 

2 

(1  - qjjj)/q^  . Also,  the  mean  number  of  completions  E(N*)  required  to 
meet  rule  3 satisfies 

m(l  - q )/ q < E(N*)  < m/ q . 

^m  m — ^m 

Now  a user  may  choose  m to  suit  one's  convenience.  However,  from  the 
optimality  viewpoint  one  prefers  the  m that  minimizes  m/q  . If  several 

IT) 
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id's  lead  to  identical  minima  then  one  prefers  the  largest  among  them  since 

2 

this  minimizes  var  (ml)  = (1  - quj)(m/q|j^)  . Naturally,  the  efficacy  of 
rule  3 remains  to  be  verified  by  careful  experimentation.  If  this  verifica- 
tion is  realized  we  will  have  a considerably  more  desirable  starting  time 
rule  than  rule  2. 
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